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Abstract 

When mutation rates are low, natural selection remains effective, and increasing the 
mutation rate can give rise to an increase in adaptation rate. When mutation rates are 
high to begin with, however, increasing the mutation rate may have a detrimental effect 
because of the overwhelming presence of deleterious mutations. Indeed, if mutation 
rates are high enough: 1) adaptation rate can become negative despite the continued 
availability of adaptive and/or compensatory mutations, or 2) natural selection may 
be disabled because adaptive and/or compensatory mutations - whether established 
or newly-arising - are eroded by excessive mutation and decline in frequency. We ap- 
ply these two criteria to a standard model of asexual adaptive evolution and derive 
mathematical expressions - some new, some old in new guise - delineating the mu- 
tation rates under which either adaptive evolution or natural selection is neutralized. 
The expressions are simple and require no a priori knowledge of organism- and/or 
environment-specific parameters. Our discussion connects these results to each other 
and to previous theory, showing convergence or equivalence of the different results in 
most cases 
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1. INTRODUCTION 



Mutations are indiscriminate alterations of highly complex organisms and, as 
such, are much more likely to be harmful than beneficial. For an individual or- 
ganism, therefore, an increase in the overall rate of mutation should be detrimen- 
tal. In a population of organisms, however, natural selection disproportionately 
favors beneficial mutations and the net effect of increasing the overall mutation 
rate is thus less clear. 

1.1 Previous studies 

Generally speaking, the population-level effects of increasing the mutation rate 
have been studied separately under two artificial assumptions: the absence of 
beneficial mutations, and infinite population size. Only a handful of studies have 
relaxed both assumptions. 

1.1.1 Absence of beneficial mutations 

When beneficial mutations are assumed to be absent and population size is fi- 
nite, fitness will undergo a slow but steady decline, because of the sluggish but 
largely irreversible accumulation of deleterious mutations. This process is es- 
pecially pronounced in asexual populations, and it was in this context that the 
process was first described by Muller ( PHI) and later dubbed "Muller's ratchet" 
( fl9|) and formalized by Haigh (33). Under the relentless accumulation of delete- 
rious mutations, fitness will decline monotonically. Most of the subsequent work 
on Muller's ratchet has focused on the rate of the ratchet, different factors affect- 
ing this rate, and in particular factors or conditions that can cause this rate to 
become negligible (i.e., that halt the ratchet) (J21 HH EU ESI E21 ESI ESI IH1 ETJ) . 
Increasing the genomic mutation rate can only accelerate Muller's ratchet. 
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1.1.2 Infinite population size 

When population size is assumed to be infinite, populations whose adaptation 
is constrained - i.e., populations in which beneficial mutations can occur but 
that have a maximum attainable fitness - will eventually achieve an equilib- 
rium fitness distribution shaped by the largely opposing forces of mutation and 
natural selection. Above a critical mutation rate dubbed the "error threshold" 
( 1171 HH]), this distribution becomes remarkably flat, indicating that a genotype's 
equilibrium frequency is essentially independent of its fitness. This conversion 
to a state of random fitness dispersion is reminiscent of a phase transition and, 
mathematically at least, the two are equivalent. The error threshold has been 
studied extensively under many different conditions that include recombination 
and departures from random mating ()5l l4"5 l l4*6il . viral complementation ( )4"9l) . and 
different static and dynamic fitness landscapes (J6j [25j SHJ SSI [52j [531 EH 155]) . 

1.1.3 Extinction 

The two classes of models described above - Muller's ratchet and the error 
threshold - encompass most previous characterizations of mutational degrada- 
tion processes. Additionally, some work has superimposed demography onto both 
Muller's ratchet fl2H S21 EE]) and error threshold OH S3]) models, finding a positive 
feedback between these processes and demographic decline toward extinction. 

1.1-4 Finite populations with beneficial mutations 

A few studies have addressed the effect of increasing the mutation rate when 
the two foregoing assumptions are relaxed - i.e., when beneficial mutations are 
accounted for and populations are finite. Under these more realistic conditions, 
the fitness decline due to Muller's ratchet can be canceled out or even reversed 
by beneficial mutations, resulting in unchanging or increasing fitness. The effect 
of beneficial mutations on Muller's ratchet has been explored previously (EJ I5U|) ; 
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these studies focused on how the effects and relative fractions of beneficial vs. 
deleterious mutations would affect the adaptation rate and whether that rate 
was positive or negative. In the present study, we focus on how the genomic 
mutation rate affects the progress of adaptive evolution and the effectiveness of 
natural selection. 

1.2 Present study 

1.2.1 Neutralizing adaptive evolution 

When genomic mutation rate is low to begin with, an increase in this rate may 
be advantageous: the increased production of deleterious mutations can be of 
disproportionately small consequence because natural selection tends to eliminate 
deleterious mutations from the population, whereas the increased production of 
rare beneficial mutations can be of disproportionately large consequence because 
natural selection can cause the fixation of beneficial mutations from which the 
entire population benefits. Thus if a population's overall mutation rate is low to 
begin with, then an increase in the mutation rate can increase the rate at which 
beneficial mutations are fixed, thereby increasing the adaptation rate, where 
adaptation rate is defined as rate of increase in mean fitness. In other words, 
a positive correlation can exist between genomic mutation rate and adaptation 
rate. 

When genomic mutation rate is high to begin with, however, an increase in 
this rate may be disadvantageous because of excess deleterious mutation. While 
the consequence of deleterious mutations is still disproportionately small, it is 
less so at high mutation rates, because deleterious mutations can be produced 
faster than natural selection can remove them. At high mutation rates, therefore, 
a negative correlation can exist between genomic mutation rate and adaptation 
rate. 

The foregoing considerations indicate a non-monotonic relationship between 
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mutation rate and adaptation rate, a relationship confirmed by simulation (Figs. 
[l]and[2]). In the present manuscript, we are interested in finding critical genomic 
mutation rates above which adaptation rate becomes negative. It seems reason- 
able to speculate that a negative adaptation rate, if sustained, would ultimately 
result in extinction. 

1.2.2 Neutralizing natural selection 

Evolution by natural selection proceeds through the appearance and subsequent 
fixation of adaptive and/or compensatory mutations. When mutation rate is 
low, virtually all adaptive and/or compensatory mutations produced have fix- 
ation potential: all of them have the possibility, at least, of enduring the first 
few generations of random sampling (surviving genetic drift (34)), outcompeting 
other adaptive and/or compensatory mutations (surviving the Hill- Robertson 
effect (136|) or clonal interference (T2TT)), and spreading to fixation. This is be- 
cause, with low mutation rates, progress to fixation is relatively unhindered by 
deleterious mutations. 

As mutation rate increases, however, the fixation potential of adaptive and/or 
compensatory mutations is reduced: each such mutation founds a lineage whose 
growth is increasingly eroded by the accumulation of deleterious mutations. As 
mutation rate continues to increase, a point may be reached at which adaptive 
and/or compensatory mutations lose their fixation potential altogether, thereby 
neutralizing natural selection. We explore three particularly telling indicators 
that this point has been reached: 1) the fittest genotype in the population (e.g., 
an adaptive mutant) decreases in frequency, 2) the fittest genotype in the popu- 
lation has an equilibrium frequency (i.e., a mutation-selection balance frequency) 
very close to zero, and 3) a newly-arising fittest genotype is ultimately doomed 
to extinction with probability one. 
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1.2.3 A key innovation: dynamical insufficiency 

In many of the previous investigations of mutational degradation processes, analo- 
gies are drawn to physical processes not least of which is the phase transition 
analogy. But the analogous physical processes typically occur on short time- 
scales during which the relevant parameters remain constant and convergence 
to equilibria occurs rapidly. This context affords the luxury of dynamically suf- 
ficient models and applicability of their steady-state analyses. In evolutionary 
biology, however, time scales are longer, relevant parameters cannot reliably be 
assumed to remain constant, and equilibria may rarely, if ever, be achieved. In the 
face of such long-term uncertainty, predictive accuracy seems unlikely; neverthe- 
less, dynamically insufficient models may provide short-term predictive accuracy. 
Fisher's "fundamental theorem of natural selection" accurately predicts the evo- 
lution of fitness over the course of a single generation; by sacrificing dynamical 
sufficiency, this theorem achieves short-term predictive accuracy. Some of the 
conditions that we derive here (the more useful conditions) employ variations 
of this approach; they depend on statistical properties of the population that, 
by virtue of their intermediate dynamical sufficiency, absorb contingencies and 
other surprises that are so characteristic of the biological world (see Discussion) 
and thereby may subsume many previous results that individually treat an ar- 
ray of different complexities and were derived under the purview of dynamical 
sufficiency. 

1.2.4 Our default application 

The discovery and development of the error threshold sparked the imaginations 
of virologists, whose efforts to clear viral infections using antiviral drugs are 
bedeviled by the high mutation rates of many viruses. If mutation rate could be 
elevated even further through mutagenesis, error threshold theory suggested that 
viral populations might be driven extinct, thereby, in a sense, beating the virus 
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at its own game (pfl QUI H2J EEH SHI)- Partly because of this historical context, 
we have adopted this particular application as our "default" application: unless 
otherwise stated, we have in mind the general aim of eradicating an unwanted 
population through mutagenesis and the inequalities we derive reflect this aim. 

1.2.5 Outline of the present study 

In the present study, we independently apply the two criteria described above to 
a standard, general model of fitness evolution in order to derive the conditions 
under which adaptive evolution and natural selection are neutralized. The condi- 
tions that we derive from criterion 1 range from sufficient to sufficient and neces- 
sary; however, it is the intermediate condition - called "sufficient and somewhat 
necessary" - that we believe is the most novel and perhaps the most practical. 
We apply criterion 2 both to a population in which the fittest genotype is resident 
(recovering the classical error threshold result in a new guise that lends itself to 
an alternative and perhaps more useful interpretation) and to one in which the 
fittest genotype is a newly-arising beneficial mutant. 

2. RESULTS 

2.1 The model 

We employ a standard model of adaptive evolution of an asexual population 
in which a genotype or class increases in log-frequency as the fitness of that 
genotype or class minus the mean fitness of the population. Mutation occurs 
among genotypes or classes as a diffusion process that is strongly biased in favor 
of deleterious mutations. Mathematical formulations of this model are given in 
the Appendix. 

In what follows, we use the bracket notation in addition to the over-bar no- 
tation to denote expectation, and we use subscripted bracket notation to denote 
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expectation with respect to the subscripted variable; for example, x = (x) denotes 
mean fitness, and (x) t denotes mean fitness averaged over time. The models we 
employ are continuous in time and thus our measure of fitness x corresponds to 
the log of fitness w used in classical population genetics (discrete-time) models. 

2.2 Criterion 1: Adaptive evolution is neutralized 

In this section, we employ a formulation of our model that is continuous in both 
time and fitness. We ask under what conditions adaptation will move backwards, 
i.e., under what conditions population mean fitness will decrease in spite of an 
inexhaustible supply of beneficial mutations. 

2.2.1 Sufficient and sufficient/necessary conditions 

Adaptive evolution is neutralized when the long-term tendency of absolute fitness 
is to decrease, despite the availability of adaptive and/or compensatory muta- 
tions. A sufficient but not necessary version of this condition imposes dx/dt < 
at all times, where x = (x) is population mean fitness. The necessary and suffi- 
cient version of this condition is {dx/dt) t < 0. These conditions imply (Appendix) 
that adaptive evolution will be neutralized and fitness will in fact decline if the 
relation 



holds persistently (sufficient) or at least on average (sufficient and necessary), 
where a x is variance in fitness, U is genomic mutation rate, and 5 X is the effect 
of mutation on fitness (a random variable) and (S x ) is its average. (While this 
expression is given in terms of fitness, an equivalent expression is derived in terms 
of a fitness-related phenotype in the electronic supplemental material, or ESM.) 
If the effects of beneficial and deleterious mutations are considered separately, 
then (S x ) = }'B m B — fD m D, where f B and f B are the fractions of all mutations 
that are deleterious and beneficial, respectively; m D and m B are the mean ef- 




(2.1) 
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fects of deleterious and beneficial mutations on fitness, respectively. Biological 
considerations overwhelmingly support /dttcd /b^b, so the left-hand side of 



( plj ) will most likely be positive. By some estimates fil l l87 | W7 \ IHH j l5T | 156]) . f B 
can be surprisingly high, however: 1) this does not necessarily imply high val- 
ues of /bibb (B71) . and 2) it seems unlikely that fs^B would ever exceed fo m D 
simply because the ways to damage a highly complex entity (such as a living 
organism) far outnumber the ways to improve it. In the very unlikely case that 



fs m B > fo m Di condition (2.1) would present a contradiction, and fitness decline 
would be impossible regardless of U. 



Critical mutation rate can be a moving target. As evidenced by (2.1 ), the crit- 
ical mutation rate required to neutralize adaptive evolution is a function of the 
fitness variance. Increasing the mutation rate, however, will often cause a subse- 
quent increase in fitness variance, in turn increasing the mutation rate required to 
satisfy ( 2.1[ ). In fact, classical population genetics (accounting for deleterious mu- 
tations only), and work by Rouzine et al. (JUHHD and Goyal et al. (1BD1) (accounting 
for beneficial and deleterious mutations) all indicate that, for low to moderate 
mutation rates, the fitness variance should tend toward —U (5 X ) following a per- 
turbation in fitnesses and/or mutation rate. This suggests that an adjustment 
in the mutation rate (perhaps through increasing the dose of a mutagen, for ex- 
ample) to satisfy the condition —U (5 X ) > a 2 will be followed by an increase in 
fitness variance such that a x — > —U{S X ), thus necessitating a further increase 
in U in order to maintain the relation —U (5 X ) > o~ 2 . Frank and Slatkin (1231) 
have pointed out that the tendency o 2 x — > —U (5 X ) represents mutation-selection 
balance (in fact, they mention this in the context of phenotypic evolution but the 
same notion applies). Figuratively, the condition —U (S x ) > a 2 may be thought 
of as a mutation rate that persistently tips the balance in favor of mutation; 
alternatively, it may be thought of as a mutation rate persistently high enough 
to prevent convergence to mutation-selection balance. As U is increased to main- 
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tain —U (5 X ) > a x in a continually adapting population, a x will eventually reach 
a maximal value (due to finite population size) and, at this point, the value of 
U need not increase further to satisfy —U (5 X ) > o~ 2 max . In Fig. [3]B, the genetic 
variance in fitness is measured in simulated populations every 100 generations 
and —U (S x ) is set at ten percent above a 2 , thereby maintaining —U (5 X ) > a 2 . 
For a long time, the positive feedback between mutation rate and fitness variance 
results in escalating adjustments to the mutation rate; after some time, however, 
the variance appears to achieve a maximum so that the mutation rate required 
for continued fitness decline levels off. 



2.2.2 Sufficient and somewhat necessary condition 

So far, we have derived conditions that lie at opposite ends of the spectrum from 
sufficiency to sufficiency- and- necessity. From a practical standpoint, however, 



both are of limited utility. Condition (2.1) insures declining fitness only for the 



current generation. The sufficient condition is that this relation hold persistently, 
but this condition may be frustratingly elusive because it fails to anticipate the 
change in fitness variance that typically follows an adjustment to the mutation 
rate. For this condition to be enforced in practice, therefore, frequent measure- 
ments of a 2 would be required, followed by adjustments in U (for example, by 



increasing the dose of a mutagen), if needed, to maintain the relation (2.1 ) (as in 



Fig. lM,.B). In practice, therefore, the sufficient condition amounts to a rather 



inconvenient protocol. The sufficient-and-necessary condition, that (2.1) hold on 
average, requires long-term future knowledge of population fitnesses that is gen- 
erally not attainable in practice. Here, we derive conditions that lie somewhere 
in the middle of the spectrum from sufficiency to sufficiency-and-necessity and 
that have increased practical applicability. 

To this end, we temper our sufficient and necessary condition: instead of 
requiring that the long-term average gradient oppose selection, we now require 
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only that the medium-term average gradient oppose selection. We will denote 
this intermediate condition as (dx/dt) r t < 0, where r denotes the number of future 
generations over which to take the average. In order to enforce this condition, 
however, one needs a way to predict the near-future course of evolution; an 
algorithm for doing this is outlined in (28). There, it is shown that prediction of 
the near-future course of evolution can be achieved by a time-discretization of a 
hierarchy of cumulant equations. 

Using the equations for fitness evolution derived in (1281) and imposing (dx/dt) r t < 
0, the condition under which adaptive evolution is neutralized may be written: 



where the future fitness variances (or second cumulants), ^2{ T ) — < J 1{ T )i are com- 
puted from the set of recursions k*(t + 1) = «i(r) + «j + i(r) + Urrii as outlined 
in ( 128]) (also, see Appendix); Ki(r) denotes the i th cumulant at generation r; 
t = denotes the present generation (called "now"), r = 1 denotes one gen- 
eration from now, r = 2 denotes two generations from now, etc., and r is the 
"predictive reach", i.e., r is how many generations into the future the algorithm 
can be trusted to predict. An alternative condition that errs conservatively is: 
— U (5 X ) > max(K 2 (0), « 2 (1), K 2 (r)). (See ESM for equivalent phenotypic ex- 
pressions.) The appearance of these equations is deceptively simple because as U 
is changed, the predictions for Kz(t) will change, i.e., the equations look explicit 
when in fact they are implicit for U. (They are implicit for U because a cer- 
tain degree of circularity is required by their intermediate dynamical sufficiency, 
which anticipates future changes in a 2 x without requiring knowledge of organis- 
mal and environmental parameters; in practice, this fact only imposes the slight 
inconvenience of having to use an iterative procedure in the calculations.) 




r-l 




T=0 
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2.3 Criterion 2: Natural selection is neutralized 

The approach that derives from this criterion takes its lead from statistical 
physics, where an "order parameter" quantifies the degree of order present in 
the system at hand. Order in an evolving population is brought about through 
the action of natural selection on genetic variation. In evolution, a natural choice 
for an order parameter is the frequency of the fittest genotype. If natural selec- 
tion is operational, the fittest genotype should persist at reasonable frequency 
despite recurrent mutation away from this genotype, and this frequency is thus 
indicative of the amount of order present in the population. As mutation rate in- 
creases, the frequency of the fittest genotype will decrease, indicating a decrease 
in the overall order present. At a sufficiently high mutation rate, the amount of 
order will approach zero. 

2.3.1 Sufficient condition 

Here, we have in mind a population that is heterogeneous and that is predom- 
inated by a fittest genotype whose frequency is uq. Our sufficient condition is 
derived by finding the mutation rate that causes the frequency of the fittest 
genotype to decrease relative to its mutational neighbors: du /dt < 0, persis- 
tently. Solving for the mutation rate that insures this inequality gives rise to the 
condition: 

Uf D > (x - x)(l/(l - uq) - l/Lu )-\ (2.3) 
where L is the size of the deleterious genome, xq is the fitness of the fittest 

L L 

genotype; x = x j^ji an d x — J] XjUj, from which we have the useful relation 
(1 — u )(x — x) = x — x. In a finite population, xq is the maximum fitness found 
in the population, and x is the average fitness of everybody else: x = ^ J2i & s • r *> 
where S is the subset of the population that has fitness less than the maximum 
and #S is the number of individuals in that subset.. If it is the case that the 
population is finite and L ^> N, then the term 1/Luq < N/L 0, giving rise 
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to the condition: Ufn>{l— u )(x Q (reported in Table 1). In 

our simulations, we assume an infinite genome (L — > oo) and finite population 
size; under these conditions, Ufo > xo — x is exact. In a continually adapting 
population, uq will be small most of the time, in which case this expression may 
be used interchangeably with: U fn ^ x — x (employed in Fig. [3]). 

2.3.2 Sufficient and necessary conditions 

(1) Mutational degradation of an established fittest genotype. Here, we have in 
mind a population that is heterogeneous but that has been predominated by a 
fittest lineage for some time. To determine the amount of order in this population, 
we compute its order parameter, uq: the equilibrium frequency of this fittest 
lineage relative to its mutational neighbors (genotypes that differ from the fittest 
lineage by mutation). We are especially interested in what happens to the order 
parameter as mutation rate increases. 

Analysis of the evolutionary model at equilibrium reveals that indeed the 
order parameter w decreases with increasing mutation rate (Appendix). The 
approach of Uq toward zero as U increases is characterized by an inflection point 
that becomes increasingly sharp as deleterious genome size L increases. The 
mutation rate at which the inflection point occurs is found by solving for the 
critical mutation rate U c that satisfies d 3 Uo/dU 3 = 0. As L increases, U c fn = 
\x c L — > xq — x, where fo is again the fraction of mutations that are deleterious, 
and \i c is the critical point mutation rate. From this result, natural selection may 
reasonably be expected to be neutralized when mutation exceeds the critical rate: 

Uf D >x -x. (2.4) 

This is the classical "error threshold" result in new guise. It is an equilibrium 
result and its practical use would therefore require knowledge of long-term future 
states of the population. 
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The equilibrium frequency of the fittest class or genotype at the "error thresh- 
old", while greatly reduced, is still greater than the frequencies of neighboring 
genotypes: at /i — /i c , the fittest genotype has frequency uq ~ \ + \ whereas 
mutational neighbors have frequency Ui < \. This stands in contrast to common 
notions about the error threshold as creating a competitive reversal that leads to 
the subordination and/or loss of the fittest genotype. In a finite population, the 
fittest genotype will be deterministically lost from the population at the error 
threshold only if N < \/L. To put this condition in perspective, we consider a 
strain of Escherichia coli that has a genome of length L pa 4.6 x 10 6 base pairs; 
if we make the very conservative assumption that mutation at any position on 
the genome will affect fitness, then any population larger than y/Z pa 2145 will 
deterministically retain the fittest genotype at the error threshold. 

Despite the persistence and continued dominance of the fittest genotype, the 
error threshold nevertheless marks a point at which the frequencies of the dif- 
ferent genotypes are so severely eroded by mutation that their frequencies are 
clearly not indicative of their fitness. This disabling of natural selection is appar- 
ent in the relation: cov(u, x)\^ =llc ~ s/L, where s ~ — (S x ). For large genomes, 
therefore, the covariance between fitness and frequency - an indicator of the 
efficacy of natural selection - is very small at the error threshold (but still posi- 
tive). Additionally, the extent to which natural selection has become ineffective 
is reflected by the amount of disorder present in the equilibrium population; a 
standard index of disorder is the Shannon entropy which, at the error threshold, 
is approximately equal to log 2 L. 

(2) Mutational degradation of a newly-arising fittest genotype. Here, we have 
in mind an asexual population that is heterogeneous and in which a benefi- 
cial mutation emerges. This mutation creates a newly-arising "fittest genotype" 
whose subsequent growth depends on the persistence of that genotype within 
the growing lineage, despite recurrent mutation away from that genotype. The 
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newly arising fittest genotype has fitness x , and the rest of the population has 
average fitness x, as before. In a single generation, the new lineage grows by 
a factor R = e x °~ x = e ( x o- x ){ l ~ u o) _ Accumulation of deleterious mutations oc- 
curs most rapidly early in the growth of a lineage ( 12 ip . when uq ~ 0, suggesting 
the approximation R m e x °~ x . Previous studies show that genomic mutation 
rates that cause the degradation of the newly-arising fittest genotype must sat- 
isfy U fr> > log R OH |9] |2T]) . The extinction of a newly-arising fittest genotype is 
therefore predicted to occur when: 

Uf D >x -x (2.5) 



(Compare to (2.4).) This result was originally derived for an independent asexual 
population growing without bound at discrete-time rate R (|2~TI) and was later re- 
derived in a way that more explicitly allowed for purifying selection and dubbed 
the "lethal mutagenesis" threshold (jBl E]) for unboundedly growing viral and 
bacterial populations. This result should also apply, however, to lineages growing 
within a population as a consequence of positive relative fitness (xq — x > 0). 
Finite population size restricts applicability to lineages that begin to decline in 
frequency before being affected by population size constraints, which seems likely 
to account for many such lineages when at or near the critical mutation rate (but 
see (26)). Those lineages that do achieve higher frequencies are likely to become 
fixed in the population, in which case the relevant condition was derived in the 



previous subsection: Ufn > x — x (condition (2.4)). It thus seems reasonable to 



conjecture that whatever the maximum frequency achieved by the new lineage, 



the condition is well approximated by (2.5). 
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3. DISCUSSION 



3.1 Practical use of the equations 

Why are accurate predictions desireable? On the surface, it seems that if one 
has the ability to increase mutation rate, perhaps through the use of a chemical 
mutagen, then to drive a population extinct, one needs only to increase the mu- 
tation rate by a large amount, perhaps by administering a high dose of mutagen. 
The problem with this approach is that, in real populations, variation in muta- 
tion rate is inevitable and resistance to a mutagen can appear. A large increase 
in the mutation rate can create strong selection pressure for a lowered mutation 
rate, and a reduction in the mutation rate may thus evolve in short order. Our 
own work on this (in prep.) has shown that Escherichia coli evolves resistance 
to a nucleoside analogue mutagen, administered at a fairly high dose, in just a 
few generations. If one could increase the mutation rate to a level that is high 
enough to cause extinction, but not too high, selection for resistance could in 
principle be reduced considerably and the evolution of resistance might be pre- 
vented. Accurate predictions for the critical mutation rate required for extinction 
may therefore aid in the practical implementation of chemical mutagenesis, and 
the evolution of resistance might be prevented. Indeed, our equations and simu- 
lations would suggest an improved protocol in which a mutagen is administered 
in incrementally increasing dose (reflected in Fig. [3]). 

Timeframe of applicability. The equations derived here are similar in their 
generality and robustness; however, they differ amongst themselves in one aspect 
of practical relevance, namely, their timeframe of applicability. Under criterion 
1, this timeframe ranges from short-term (sufficient) to medium-term (sufficient 
and somewhat necessary) to long-term (sufficient and necessary). Under crite- 
rion 2, the timeframe is short-term (sufficient) or long-term (sufficient and neces- 
sary). The long-term results might potentially be applied approximately using a 
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running-average approach that is necessarily somewhat arbitrary, but technically 
correct application of these results requires information about long-term future 
states of the population that would not be obtainable in practice. When the mu- 
tation rate is adjusted according to fitness measurements from a population taken 
in real time, the correct equations to use are the short-term and medium-term 
conditions. These conditions are applied in simulation studies of which repre- 
sentative runs are presented in Fig. [3] there, adjustments to the mutation rate 
are made in real time and the short-term and medium-term conditions derived 
under criterion 1 (labeled "variance" and "variance-projection" thresholds, re- 
spectively) perform well, whereas the conditions derived under criterion 2 (error 
threshold) appear to be less well-suited to such real-time application. 

Adaptation in a static environment. A population adapting in a static envi- 
ronment typically has a limited, non-renewable supply of available beneficial mu- 
tations (barring intransitive interactions). As the population adapts, therefore, 
the supply of available beneficial mutations is slowly depleted; as a consequence, 
mean fitness may increase and subsequently decrease, and fitness variance may 
also change over time, thereby changing the minimal mutation rate prescribed 



by (2.1). This is shown schematically in Fig. |4j in static environments and, 
generally speaking, in environments where the supply of beneficial mutations can 
change over time, adaptive evolution or natural selection may be neutralized not 
as a result of changes in the mutation rate (i.e., changes in —U (S x )) but as a 
result of changes in the requirements on the mutation rate (i.e., changes in a x ). 
This is the basis of an illustrative application to immunology that is described in 
the ESM. 



3.2 Connections to previous theory 

Fisher and Kimura. Fisher's "fundamental theorem of natural selection" states 
that, when x is defined as additive genetic fitness, dx/dt = o 2 x quite generally ( 12TH 
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. Fisher's theorem shows that this particular component of fitness can only 
increase (variance is a non-negative quantity); consequently, this component of 
fitness has accurately been called the "adaptive engine" of natural selection (31). 
This component of fitness, however, must be conserved over time, for example, in 
the transmission from parent to offspring for Fisher's theorem to apply. If there 
is a component of fitness that is not conserved, then to find the change in total 
mean fitness of the population (conserved and non-conserved), one must use the 
chain rule: dx/dt = J (^u(x,t) + x-^u(x,t)) dx which, together with ^u(x,t) 
: .; — x)u(x, t), yields dx/dt = a x + (dx/dt) x - a fact pointed out by Kimura (38). 
(Note the subscript x, indicating that here the average is taken over fitness or, 
equivalently, over individuals in the population.) As a general rule, (dx/dt) x will 
be negative, because random alterations in the organism or its environment are 
more likely to decrease the organism's fitness than to increase it. This simple 
calculation illustrates the fact that Fisher's fundamental theorem applies only to 
that subset of the population whose additive genie fitness is conserved over the 
period of time in question: only for this particular subset of the population are 
we guaranteed that mean fitness will not decrease. 

Applying our criterion 1 to Kimura's equation yields a more general condition 
for the disabling of adaptive evolution: 

- (dx/dt) x > a 2 x (3.1) 

must hold persistently or at least on average. Here, the mechanism of change in 
individual fitnesses over time is not specified. If we specify that the mechanism 
of change is mutation, then (dx/dt) x = U J s S x g(5 x ,t) = U (5 X ), where g(5 x ,t) is 



the distribution of mutational effects on fitness, and we recover Eq. (2.1). 



The error threshold. As previously stated, Eqs. (2.4) and (2.5) are the error 
threshold in new guise. The original work on the error threshold due to Eigen (fTTl) 
derives a minimum value for the "quality factor" - the probability of complete 
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fidelity of replication - that is needed to maintain the efficacy of natural selection 
and thus to support life. This minimum value is given by J2 m i n = (£/k^m + @m — 
^ k7 L m )/^/ m (Eq. 11-45 in flTTl) ). where srf m and 3i m are the birth and death rates 
of the fittest genotype (the "master sequence"), respectively, ^4^ m and J^ m 
are the mean birth and death rates of the rest of the population (individuals 
that do not carry the "master sequence"). The quantity s^ rn {l — £} m %n) is the 
expected number of deleterious mutants produced by a single replication event 
of the fittest genotype. This quantity, in our notation, is Ufo] furthermore, 
srf m — *3) m is equivalent to our xq and — S>k^m is equivalent to our x. Eigen's 
result may thus be rewritten in our notation as requiring U fo < xq — x for the 
effectiveness of natural selection to be maintained, or conversely, Ufn> Xq — X 
for natural selection to be neutralized. 

In work subsequent to Eigen's original publication, the varied presentations 
of his error threshold result are usually rearrangements of this simple expression: 
Qmin = °~ X i where q min is the minimum per-nucleotide replication fidelity required 
for survival (q m in = 1 — At c ), L is the length of the deleterious genome, and a 
is the "superiority parameter", defined as a = 1/J2 m i n . Rewriting reveals an 
interesting biological requirement: logg^ in ~ — fi c L gives rise to the relation: 

H c ~ (something) /L. (3.2) 

This inverse relation between fi c and L intrigued its discoverers to the extent 
that the "something" was all but ignored. It was since discovered, however, that 
observations of fiL are surprisingly constant across microbial taxa (1T5| ll~6|) (in- 
deed, it has been conjectured that this is the case precisely because of the inverse 
relation between /i c and L). The relative constancy of fiL across taxa suggests 
that the "something" may in fact be quite relevant to the fate of a population; 
furthermore, L will probably not change on time scales pertinent to extinction- 
by-mutation. These considerations shift the focus to a. Its name together with 
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its traditional presentation obfuscates the fact that a is a population-dependent 
quantity and not an organism-dependent parameter. Our new presentation of this 
old result shifts the emphasis from critical point mutation rate \x c vs. genome 
length L to critical genomic mutation rate fi c L vs. the myriad biological, eco- 
logical and environmental factors that are not explicitly part of the equation but 
that are absorbed by the quantity a or, in our formulation, xq — x. 

3.3 Connections among results presented here 

Our first criterion is the sustained decline of absolute fitness, whereas our second 
criterion is the inefficacy of natural selection. We now show that, despite these 
perhaps disparate criteria, the resulting conditions for extinction connect through 
classical population genetics. Criterion 1 gives rise to the condition —U (5 X ) > a 2 , 
and criterion 2 gives rise to the condition Ufr> > Xq — x. Our comparison of 
these two results proceeds by multiplying both sides of the second condition by 
rriD to obtain UfomD > {xo — x)mD- First, we note that the left-hand side 
is Ufom-D ~ —U (5 X ) because most mutations are deleterious. Next, we focus 
on the right hand side of the inequality. As a population approaches the error 
threshold (i.e., as this inequality approaches equality), the size of the fittest class 

N 

approaches zero and it is the case that J2ies Xi ~* ~k S x u or % ~ ^- The 

i=l 

quantity Xq — x is known in classical population genetics as the genetic load, and 
it is known to converge to the deleterious mutation rate Ufu. Furthermore, it is 
known that if mutations are assumed to have a fixed deleterious effect, mo, then 
the number of accumulated mutations becomes Poisson distributed with mean 
Ufn/'nT'D The variance in number of accumulated mutations is the same as 
the mean, and the variance in fitness is therefore a 2 x = (C//d/ttid) tu 2 d = UfnTno- 
As the error threshold is approached, therefore, the right-hand side becomes 
(x - x)m D -> (x - x)m D = Uf D m D = o 2 x . 
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3.4 Borrowed robustness 

Fisher's fundamental theorem of natural selection is known to be extraordinarily 
accurate in spite of numerous complexities that are characteristic of real popu- 



lations. Because (2.1) is implicit in the results of Fisher and Kimura, therefore, 



we expect these results to be quite robust to numerous biological complexities. 



Furthermore, the convergence we have demonstrated between (2.1), (2.3) and 



(2.5) lead us to believe that the classical error threshold result is similarly ro- 
bust, although it does not appear to perform as well in real time (Fig. [3]). Figure 
2] together with the plots we have posted in the ESM - and many others not 



posted - demonstrate the robustness of (2.1) (and by inference (2.2)) to a wide 
range of complexities, including finite genome effects, the effects of finite popu- 
lation size (including Muller's ratchet), epistatic interactions among mutations, 
environmental noise (random changes in fitness caused by unspecified factors), 
an evolving mutational robustness modifier, compensatory mutations whose rate 
increases with decreasing fitness, an evolving mutation rate modifier, and a frac- 
tion of mutations that are lethal. 
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APPENDIX 



The results we describe in the main text derive from two manifestations of a standard 
model of evolution described verbally under the subheading "The model". Here we 
give the mathematical details of those manifestations: 

Model in continuous fitness for Criterion 1. We let u(x,t) denote the density 
of individuals in the population with log-fitness x at time t. Mutation can create 
"jumps" in log-fitness whose size has probability density g(4>, t) at time t. Under 
selection and mutation, a population's evolution is described by: 

d 

—u(x,t) = {x — x) u(x,t) 

+°° N (A.3) 

U [ / u(x — cf),t)g((j),t)d(f> — u(x,t) 

where x = J_°° xu(x, t)dx. If we apply the standard diffusion approximation to the 
mutation term, this equation becomes 

ttMx, t) = (x - x)u(x, t) + UMu(x, t). (A.4) 
ot 

Mutation operator, M = D x -^-d x ^, where D x = \fB(rn 2 B +a 2 B )+\f D (m 2 D +o 2 D ) and 
d x = fB m B — fD m D] Jb is the fraction of all mutations that are beneficial ("beneficial 
fraction"), fjj is the deleterious fraction; mg and mu are the mean effects of beneficial 
and deleterious mutations on fitness, respectively; o\ and a 2 D are the variances in those 



effects. We multiply both sides of (A.4) by x and integrate over all x to obtain x = 
<j\ + U J_°° NLxu(x, t)dx. Under the reasonable assumption that u(x, t) has compact 
support in x, integration by parts gives x = a 2 + U (S x ), where (5 X ) = fs^riB — Id^d- 
The condition x < reflects the disabling of adaptive evolution and is met when 

-u(s x )><4. 

Model in discrete fitness for Criterion 2. As an indication of the amount of 
order in the system at hand, we would like to know the frequency of the fittest genotype 
relative to its mutational neighbors. The dynamics of this genotype and its mutational 
neighbors (genotypes that differ from the fittest genotype by mutation) are given by 
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this set of equations: 

L 

Ui = (xi — XjUj)ui — LfiUi + u ii (A-5) 

where no = the frequency of the fittest genotype (the order parameter), n, = the 
frequency of mutational neighbor, i = 1,2, 3, L, X{ = fitness of genotype i, and \i = 
point mutation rate. 

The equation for the fittest genotype no may be written as: 

uo = (xq - x)(l- u )u - fiLu + m(1 - «o), (A.6) 

where xq is the fitness of the fittest genotype and x is the average fitness of everybody 

L 

else: x = x j u j/0- ~ u o)- We noted that xo — x is not relative fitness; a possible 
interpretation of the value xq — x is that it is the reproductive "payoff" in a game 
played by the fittest genotype against everybody else. 

To find the "sufficient condition" for neutralizing natural selection, we solve for the 
H that satisfies uq < and then make the substition \xL = Ufo, giving the condition: 

Uf D > ( XQ -x)(l-^^\ \ (A.7) 

where xq — x = (xq — x)(l — uo) and x is simply the mean fitness of the population. 
In a continually adapting population, uq will be small most of the time, and x ~ x. 



Thus, for large L and L ^> N , (A.7) may be approximated by XJfo ^ xq — x. 

To find the "sufficient and necessary conditions", we derive the order parameter 
?io, the equilibrium value of uq that satisfies Uq = 0. Analysis of this equilibrium 
reveals that indeed the order parameter no decreases with increasing mutation rate. 
The approach of no has a sharp inflection point whose position is found by setting 
<9 3 no/<9/i 3 = 0; for large genome size L, it occurs at a critical mutation rate given 
by U c fu = /j, c L = xq — x, where U c is the critical genomic mutation rate, fjj is the 
deleterious fraction, u c is the critical point mutation rate. In a finite population, xq 
is the maximum fitness found in the population, and x = ^X^es 2 '*' where N is 
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population size and S is the subset of the population that has fitness less than the 
maximum. This is the error-threshold result (|17l [T8]) in new guise and may be phrased 
as follows: natural selection is neutralized when: 



Uf D >x -x. (A.8) 

Calculating the "sufficient and somewhat necessary" conditions under 
Criterion 1. To compute the "sufficient and somewhat necessary" conditions requires 
projection of cumulants Kj(r) over a period of r generations into the future. Recurrence 
relations that do this are developed in (|28j) : 



The terms of the sum in (2.2) are computed from the recurrence relation: Kj(r) = 
Kj(r — 1) + Kj + i(r — 1) + Urrii for all i > 1, where k%(t) is the i th cumulant in fitness 
at a time r generations from now, U is genomic mutation rate and rrii is the i th raw 
moment of the distribution of mutational effects on fitness. 



The practical implementation of condition (2.2) requires some care. The procedure 
outlined in (|28|) provides methods for estimating the rrij. These parameters cannot be 
estimated separately from U; only their products Urrij can be estimated, if the equa- 
tions are left in non-parametric form. The obvious remedy is to make the equations 
parametric by writing the known expressions for the moments of an assumed distri- 
bution in place of rrij. Then, the parameters to be estimated are U and the limited 
number of parameters of the assumed distribution, and U can then be estimated sep- 
arately. If one's objective is to monitor a population's risk of extinction, or to drive a 
population extinct through mutagenesis, however, a less obvious remedy may apply. In 
such cases, absolute mutation rates may be irrelevant, and the effects of an increased 
(or decreased) mutation rate can be predicted by simply multiplying the estimates of 
Urrij by the factor by which mutation rate is increased (or decreased). In such cases, 
therefore, the equations may be left in non-parametric form. 
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Figure Captions 

Figure 1. Time- averaged fitness gradients from simulations as a function of genomic 
mutation rate. Simulations are fully stochastic and individual-based; populations are 
asexual. Population size is 10,000; fraction of mutations that are deleterious is con- 
stant at 0.1; fraction of mutations that are beneficial is constant at (green), 10 -5 
(orange), 10 -4 (blue), and 10 -3 (red); effects of mutations are drawn at random from 
an exponential distribution with mean 0.03. At high enough mutation rates, the rate 
of fitness increase becomes negative (indicating persistent fitness decline), from which 
inference of eventual extinction seems reasonable. 



Figure 2. [A) Time-averaged adaptation rate as a function of genomic mutation rate. 
The point at which adaptation rate becomes negative marks the threshold mutation 
rate, indicated by the blue vertical line. (B) Predictions for the threshold mutation 



rate. Yellow triangles plot the variance threshold given by Eq. (2.1); blue diamonds 



plot the error threshold given by Eq. (2.4); red line plots genomic mutation rate U. 
Where threshold predictions intersect with the red line marks the predicted threshold 
mutation rate for these simulations, and coincides exactly with the observed threshold 
mutation rate in panel (A). Each point represents an average taken over the full time 
course of eight fully stochastic, individual-based simulations of evolving asexual pop- 
ulations. Population size was 10,000, fraction of mutations that were beneficial and 
deleterious were 0.001 and 0.5, respectively. Ten percent of deleterious mutations were 
lethal; otherwise, beneficial and deleterious mutations were drawn from an exponen- 
tial distribution with mean 0.03. Epistasis among deleterious mutations is synergistic 
with epistasis parameter 0.1 and epistasis exponent 5 (see ESM for epistasis function). 
Several similar plots with different sets of biological complexities are posted in the ESM. 



Figure 3. Real-time application of the different thresholds. Each panel plots a single 
representative simulation run. Simulations and parameters are the same as those in Fig. 
[IJ with beneficial fraction set at 10 -4 . (A,B) Variance threshold given by Eq. ( |2.1[ ). 
Every 100 generations, fitness variance is measured and U is set equal to —0.9a^,/ (5 X ) 
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(Panel A) and — l.la^,/ (5 X ) (Panel B). (C,D) Variance-projection threshold given by 



Eq. (2.2). Every 500 generations, fitness measurements are used to compute cu- 
mulants Ki(r) from which the K2{t) are calculated (Appendix), and U is set equal 
to -0.9^^2(^/(4) (Panel C) and -l.l£ £ « 2 (r)/ (5 X ) (Panel D). (E,F) Error 

r=0 r=0 

threshold given by Eq. (2.4). Every 100 generations, U is set equal to 0.9(:co — x)/ fo 
(Panel E) and l.l(xo — 5)//d (Panel F). Our Criterion 1 appears to perform better 
for such real-time application than Criterion 2. 



Figure 4. Schematic of how adaptive evolution and/or natural selection may be neu- 
tralized not as a result of increasing the mutation rate but as a result of a decreasing 
threshold mutation rate. The red line indicates the mutation rate of the population; 
the black line plots the threshold mutation rate as a function of the fraction of mu- 
tations that are beneficial (horizontal axis). The big blue arrow indicates that as a 
population adapts in a static environment, its supply of beneficial mutations is used 
up, resulting in a decreasing fraction of mutations that are beneficial. As this fraction 
decreases, the threshold mutation rate decreases, until eventually the threshold muta- 
tion rate is below the mutation rate of the population. If this relation is sustained, it 
seems reasonable to infer that extinction would ensue. 
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Table 1: 


Summary of neutralizing 


conditions 


Neutralizes: 


Adaptive evolution 
(Criterion 1) 


Natural selection 
(Criterion 2) 


Sufficient: 


-U (S x ) > ol 
persistently 


Uf D >x -x 
persistently* 


Sufficient and 

somewhat necessary: 


-t/(4>>;E4) 

intermittently 




Sufficient and necessary: , ^ > ° x 

long-term average 


Uf D >x -x 
long-term steady state 



* This condition holds when L > N 
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Figure 1 
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Threshold mutation rate: 
predicted (bottom) and observed (top) 




Figure 2 
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Variance threshold Variance-projection threshold Error threshold 

10% under threshold C 10% under threshold ^ 10% under threshold 
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Figure 3 
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Threshold mutation rate 



Beneficial fraction decreases as 
^ population adapts, lowering threshold 




0.07 



0.001 0.01 

Beneficial fraction 



Figure 4 
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